Explainable Multi-Class Classification Based on Integrative Feature Selection for Breast Cancer Subtyping

نویسندگان

چکیده

Breast cancer subtype classification is a multi-class problem that can be handled using computational methods. Three main challenges need to addressed. Consider first the high dimensionality of available datasets relative extremely small number instances. Second, integration different levels data makes even more challenging. The third challenging issue ability explain predictions provided by machine learning model. Recently, several deep models have been proposed for feature extraction and classification. However, due size datasets, they were unable achieve satisfactory results, particularly in Aside from that, explaining impact features on has not addressed previous works. To cope with these problems, we propose multi-stage selection (FS) framework two schemes. Using multi-omics data, four models, namely support vector machines, random forest, extra trees, XGBoost, investigated at each level. SHAP was used how specific influenced Experimental results demonstrated ensemble early stage improved compared baseline experiments state-of-the art Furthermore, explanations regarding implications relevant are provided, which could serve as future biological investigations.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-class feature selection for texture classification

In this paper, a multi-class feature selection scheme based on recursive feature elimination (RFE) is proposed for texture classifications. The feature selection scheme is performed in the context of one-against-all least squares support vector machine classifiers (LSSVM). The margin difference between binary classifiers with and without an associated feature is used to characterize the discrim...

متن کامل

Feature Subset Selection for Multi-class SVM Based Image Classification

Multi-class image classification can benefit much from feature subset selection. This paper extends an error bound of binary SVMs to a feature subset selection criterion for the multi-class SVMs. By minimizing this criterion, the scale factors assigned to each feature in a kernel function are optimized to identify the important features. This minimization problem can be efficiently solved by gr...

متن کامل

Direct Sparsity Optimization Based Feature Selection for Multi-Class Classification

A novel sparsity optimization method is proposed to select features for multi-class classification problems by directly optimizing a l2,p -norm ( 0 < p ≤ 1 ) based sparsity function subject to data-fitting inequality constraints to obtain large between-class margins. The direct sparse optimization method circumvents the empirical tuning of regularization parameters in existing feature selection...

متن کامل

Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets

Objective(s): This study addresses feature selection for breast cancer diagnosis. The present process uses a wrapper approach using GA-based on feature selection and PS-classifier. The results of experiment show that the proposed model is comparable to the other models on Wisconsin breast cancer datasets. Materials and Methods: To evaluate effectiveness of proposed feature selection method, we ...

متن کامل

Classification and feature selection algorithms for multi-class CGH data

UNLABELLED Recurrent chromosomal alterations provide cytological and molecular positions for the diagnosis and prognosis of cancer. Comparative genomic hybridization (CGH) has been useful in understanding these alterations in cancerous cells. CGH datasets consist of samples that are represented by large dimensional arrays of intervals. Each sample consists of long runs of intervals with losses ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2022

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math10224271